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ABSTRACT 

The cDNA encoding digalactosyldiacylglycerol galactosy transferase (DGD1) 
is provided. The deduced amino acid sequence is also provided. Methods of 
making and using DGD1 to screen for new herbicides and alter a plant's leaf lipid 
5 composition are also provided, as well as expression vectors, transgenic plants or 
other organisms transf ected with said vectors. 
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DNA ENCODING FOR PLANT DIGALACTOSYLDIACYLGLYCEROL 
GALACTOSYLTRANSFERASE AND METHODS OF USE 

5 

FIELD OF THE INVENTION 

The invention relates generally to plant galactolipids and more particularly 

10 the gene encoding digalactosyldiacylglycerol galactosyltransferase. 

BACKGROUND OF THE INVENTION 
The process of photosynthesis is the basis for all life on earth because it 
provides oxygen and ultimately converts inorganic matter into organic matter. The 
photosynthetic apparatus in plant cells is associated with a particular membrane 

15 system inside chloroplasts, the thylakoids. Four lipids are found to be associated 
with thylakoid membranes in plants and photosynthetic bacteria. Only one of 
them is a phospholipid, the ubiquitous phosphatidylglycerol. The other three are 
non-phosphorous diacylglycerol glycolipids with one or two galactose moieties or 
a sulfonic acid derivative of glucose attached to diacylglycerol. Browse, J. et al., 

20 Ann. Rev. Plant Physiol. Plant Mol. Biol. 42:467 (1991); Joyard, J. et al. f Plant 
Physiol. 118:715 (1998). The galactolipids constitute the bulk (close to 80%) of 
the thylakoid lipid matrix and within green plant parts, 70-80% of the lipids are 
associated with photosynthetic membranes. Taking into account that plants 
represent the major portion of the global bioorganic matter, it comes as no 

25 surprise that the two galactolipids, mono- and digalactosyldiacylglycerol, are the 
most abundant lipids in the biosphere. Most vegetables and fruits in human or 
animal diets are rich in galactolipids. Their breakdown products represent an 
important dietary source of galactose and polyunsaturated fatty acids. Ohlsson, 
L. et al., J. Nutrition 128:239 (1998); Andersson, L et al., J. Lipid Res. 36:1392 

30 (1995). The elucidation of the pathway for galactolipid biosynthesis has been 
extremely challenging. Thylakoid membrane lipid biosynthesis in plants is highly 
complex bringing together carbohydrate and fatty acid metabolisms. There is a 
mesmerising number of molecular species for each thylakoid lipid due to the large 
number of combinatorial possibilities for fatty acid substituents. Even more 

35 dazzling, the biosynthesis of thylakoid lipids is not restricted to enzymes 
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associated with the chloroplast where galactolipids are found, but also involves 
enzymes in the endoplasmic reticulum (ER) (Figure 1), The mechanism for 
subcellular trafficking of lipid moieties from the ER that ultimately become 
incorporated into the thylakoid lipids inside the plastids poses one of the most 
5 challenging enigmas of modern plant biochemistry. Molecular species of 
galactolipids containing diacylglycerol moieties derived from the plastid or the ER 
pathway can be distinguished based on their fatty acid composition. Heinz, E. et 
al., Plant Physiol. 72:273 (1983). Lipid moieties assembled inside the plastid 
carry preferentially a 16-carbon fatty acid in the s/?2-position of diacylglycerol, 

10 while lipids derived from the ER pathway contain an 18-carbon fatty acid in this 
position. This is due to different substrate specificities of the respective 
acyltransferases in the plastid and the ER. An extensive screening of different 
plant species revealed that the plastid pathway is dispensable in many plants. 
Mongrand, S. et al., Phytochemistry 49:1049 (1998). However, no naturally 

15 occurring plant has been found, in which the ER pathway was non-functional. A 
mutant of Arabidopsis, actl, is partially blocked in the plastid pathway, Kunst, 
L. et al., PNAS (USA) 85:4143 (1988). This mutant is deficient in the 
acyltransferase which catalyses the biosynthesis of lysophosphatidic acid inside 
the plastid (Figure 1). Other mutants of Arabidopsis have been described that 

20 affect the fatty acid and, thus, the molecular species composition of thylakoid 
lipids. Browse, J. et al., in Arabidopsis, E.M. Meyerowitz and C.R. Somerville, 
Eds. (Cold Spring Harbor Laboratory Press, New York) pp. 881-912 (1994). 
Most of these are deficient in fatty acid desaturases. However, the only higher 
plant mutant known to be directly affected in galactolipid assembly is the dgdJ 

25 mutant of Arabidopsis. Dormann, P. et al., Plant Cell 7:1801 (1995). In this 
mutant the relative amount of the digalactosyl lipid is reduced to 10% of wild 
type. It has already proven to be very valuable in assessing the importance of the 
digalactosyl lipid for the assembly and function of the photosynthetic 
membranes. Growth, chloroplast ultra structure, the composition and relative 

30 ratios of different pigment protein complexes, the light utilization by the 
photosynthetic apparatus, and the import of proteins into chloroplasts are 
affected in the dgdf mutant. Hartel, H. et al., Plant Physiol. 115:1175 (1997); 
Reifarth, F. et al. r Biochemistry 36:1 1769 (1997); Hartel, H. et al., Plant Physiol. 
Biochem. 36:407 (1998); Chen, L.-J. et al., Plant J. 16:33 (1998). In addition to 

35 the reduction in the amount of galactolipid, the dgdl mutant also shows a 
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peculiar alteration in the fatty acid composition of the monogalactosyl lipid with 
a characteristic increase in the amount of molecular species containing 18- 
carbon fatty acids, the accumulation of these molecular species of the 
monogalactosyl lipid is consistent with their presumed precursor function in the 
5 biosynthesis of the digalactosyl lipid. Based on labelling experiments with 
isolated chloroplasts (van Besouw, A. et al., Biochim. Biophys. Acta 529:44 
(1978); Hemmskerk, J.W.M. et al., Plant Physiol. 93:1286 (1990)), it has been 
proposed that one galactose moiety is transferred from one monogalactosyl lipid 
onto a second to form the digalactosyl lipid (Figure 1). The released 

10 diacylglycerol moiety is made available for further thylakoid lipid assembly with 
the bulk appearing in monogalactosyl lipid. As can be assumed from the fatty 
acid composition of the digalactosyl lipid in the wild type (Browse, J. et al, 
Biochem. J. 235:25 (1986)), the responsible enzyme is specific for molecular 
species derived from the ER. Accordingly, approximately equal amounts of ER- 

1 5 derived molecular species are found in the digalactosyl and monogalactosyl lipids 
(Figure 1). Therefore, it is expected that the disruption of digalactosyl lipid 
biosynthesis in the dgdl mutant also disturbs the assembly of other thylakoid 
lipids, in particular the ER-derived monogalactosyl lipid. 

It would thus be desirable to provide the wild-type DGD1 gene encoding 

20 for digalactosyldiacylglycerol galactosyltransf erase (DGD1). It would also be 
desirable to isolate and purify the gene product. It would be further desirable to 
provide in vitro and in vivo assays to screen for new herbicides that inhibit the 
DGD1 gene product. Galactolipids are unique to plants and other photosynthetic 
organisms. Therefore, in contrast to most herbicides currently in use f herbicides 

25 that inhibit galactolipid biosynthesis will not be toxic to animals, humans or 
microbial organisms in the soil. 

It would also be desirable to control the digalactosyldiacylglycerol levels in 
plants by controlling the expression of the gene encoding for the DGD1 protein. 
It would further be desirable to transform plants using the gene in order to alter 

30 their lipid composition. An alteration in lipid composition would provide plants 
with an increased resistance to environmental factors such as, but not limited to f 
temperature stress and/or pathogen infection. It would further provide an 
increase in the yield of crop plants such as leafy vegetables. 
SUMMARY OF THE INVENTION 

35 The present invention provides a novel purified and isolated nucleic acid 
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sequence encoding digalactosyldiacylglycerol galactosyltransferase (DGD1 ). The 
cDNA encoding DGD1 is set forth SEQ ID NO: 1. The deduced amino acid 
sequence of DGD1 is also provided and set forth in SEQ ID NO: 2. The protein has 
a predicted molecular weight of 91.8 kDa and has some sequence similarity in the 
5 C-terminal portion to bacterial and plant glycosyltransferases. 

Methods for making and using the cDNA encoding DGD1 are also provided. 
For example, wild-type DGD1 can be used to produce recombinant DGD1 in 
bacteria or yeast. Such recombinant protein can be used in either an in vivo or in 
vitro assay to screen compounds for new herbicides. Additionally, DGD1 may be 
10 used to alter a plant's leaf lipid composition thus altering sensitivity to 
environmental factors such as, but not limited to, temperature stress and/or 
pathogen infection and, in some cases, increase the yield of crop plants. 
Expression vectors containing the cDNA, transgenic plants and other organisms, 
e.g., £ coii t transfected with said vectors, as well as seeds from said plants, are 
1 5 also provided by the present invention. 

Additional aspects, advantages, and features of the present invention will 
become apparent from the following description, taken in conjunction with the 
accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
20 The various advantages of the present invention will become apparent to 

one skilled in the art by reading the following specification and by referencing the 
following drawings in which: 

Figure 1 is a schematic illustrating galactolipid biosynthesis in Arabidopsis) 
Figure 2 is a photograph of four-week old Arabidopsis plants showing the 
25 appearance of the wild type, act1, dgdJ and acf 7, dgdl double mutants; 

Figures 3A-3D are schematics illustrating the map-based cloning of the 
DGD 1 gene; 

Figure 4 is a photograph of a thin-layer chromatograph showing 
reconstitution of the plant galactolipid biosynthetic pathway in E coff. 
30 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The cDNA sequence encoding digalactosyldiacylglycerol 
galactosyltransferase (DGD1) is set forth in SEQ ID NO: 1. The deduced amino 
acid sequence is provided in SEQ ID NO: 2. The protein has a predicted mass of 
91.8 kDa. Sequence comparisons show some similarity in the C-terminal portion 
35 to bacterial and plant glycosyltransferases. 
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A method for producing DGD1 in a host cell is also provided in the present 
invention. The method includes the steps of introducing an expression vector 
comprising a cDNA encoding DGD1 or a functional mutant thereof into a host cell 
and expressing the cDNA in an amount sufficient to permit purification of the 
5 DGD1 . A vector may include a promoter that is functional in either a eukaryotic or 
prokaryotic cell. Preferably, the vector is introduced into a prokaryotic cell, such 
as £ coli that is routinely used for production of recombinantly produced proteins. 
Alternatively, the vector is introduced into a eukaryotic cell, such as 
Saccharomyes cerevisiae (yeast), that is also routinely used for the production of 

10 recombinantly produced proteins. It is further contemplated that DGD1 may be 
manufactured using standard synthetic methods. 

The availability of large amounts of recombinant protein will permit the 
rapid screening of compounds to identify new herbicides. It will be appreciated 
that either a cell lysate, partially purified or purified recombinant DGD1 can be used 

15 in both in vitro and in vivo screening assays. It will also be appreciated that 
purified protein from a plant such as Arabidopsis is also contemplated within the 
present invention, and can also be used to screen for new compounds. For 
example, galactosyltransferase activity assay is provided wherein the amount of 
digalactosyldiacylglycerol (DGD) produced in £ coli expressing recombinant DGD1 

20 and monogalactosyldiacylglycerol synthase is determined by thin layer 
chromatography. Thus, in a preferred embodiment, the polar lipids are extracted 
from the £ coli cells with one volume of 1M KCl and 0.2M H3PO4 and 2 volumes 
of methanol/chloroform (1:1, v/v). In another embodiment, the extracted polar 
lipids are separated by thin layer chromatography on ammonium sulfate- 

25 impregnated silica plates developed in acetone/toluene/water (90:30:8, v/v/v/). 
DGD lipid is then visualized by staining with cc-naphthol. It will be appreciated that 
£ coli expressing recombinant DGD1 can be exposed to various compounds and 
the effect of such treatment on DGD production assessed. 

Once a compound is identified as an inhibitor of DGD1, mutagenesis can be 

30 used to create DGD1 mutants that show decreased or no sensitivity to the 
inhibitory compound. DGD1 mutants can be made by known methods such as, 
but not limited to, site-directed mutagenesis or random mutagenesis, followed by 
screening for an active DGD1 mutant. It will be appreciated that such a mutant 
gene would be suitable for overexpression in crop plants, conferring resistance to 

35 the selected inhibitor compound. 
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Furthermore, sequences of the present invention may be used to alter a 
plant's leaf lipid composition, Naturally-selected mutants of Arabidopsis with 
either decreased or increased expression of DGD1 show altered lipid and fatty acid 
composition. Altering a plants leaf lipid composition can increase a plant's 
5 resistance to environmental factors such as, but not limited to, heat and/or cold 
stress, increase resistance to pathogen infection, and/or increase crop yield, 
especially of leafy vegetables such as lettuce. The method of altering leaf lipid 
composition of a plant includes the steps of introducing an expression vector 
comprising a cDNA encoding DGD1 or a functional mutant thereof, operably linked 

10 to a promoter functional in a plant cell into the cells of plant tissue and expressing 
the encoded protein in an amount effective to alter the leaf lipid composition. The 
level of expression can be increased by either combining the cDNA with a 
promoter that provides for a high level of expression, or by introducing multiple 
copies into the cell so that multiple copies are integrated into the genome of 

1 5 transformed plant cells. Once transformed cells exhibiting increased DGD1 activity 
are obtained, transgenic plants and seeds can then be regenerated therefrom, and 
evaluated for the stability of the inheritance of altered leaf lipid composition. 

The DGD1 nucleotide sequence may thus be fused to a gene or fragment 
thereof, which allows it to be expressed in a plant cell. The DGD1 nucleotide 

20 sequence in combination with the gene or gene fragment, is referred to as an 
"expression vector" herein. It will be appreciated that the expression vectors of 
the present invention may contain any regulatory elements necessary and known 
to those skilled in the art for expression of DGD1 . For example, such vectors may 
contain, but are not limited to, sequences such as promoters, operators and 

25 regulators, which are necessary for, and/or may enhance, the expression of DGD1 . 

The invention also provides the nucleic acid sequence from a shorter gene 
on chromosome 4 of Arabidopsis tha/iana that has a high sequence similarity to 
DGD1 {blast P score of 360). The nucleic acid sequence, DGD2, is set forth in 
SEQ ID NO: 3 (GenBank Accession No, AF058919). The deduced amino acid 

30 sequence DGD2 is provided in SEQ ID NO: 4. The gene, DGD2 encodes a protein 
missing approximately 340 amino acids of the N-terminal portion as compared to 
DGD1, but shows similarity to the glycosyltransf erase-like sequence part of 
DGD1. The amino acid sequence homology between DGD1 and DGD2 is 64.4% 
over 365 amino acids. This high degree of homology indicates that DGD2 would 

35 have the same activity and a similar function as DGD1 in plants. Therefore, it 
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will be appreciated that the DGD2 gene can be used with the methods of the 
present invention. The predicted DGD1 protein contains an N-terminal transit 
peptide typical for proteins imported into the plastid. Furthermore, two strongly 
hydrophobic domains (anftino acids 347-372 and 644-670) were found in the 
5 sequence. While not wishing to be bound by theory, this observation agrees with 
a proposed association of DGD1 with the plastid envelope membranes. Block, 
M.A. et al., J. Biol. Chem. 258:13281 (1983); Cline, K. et al„ Plant Physiol. 
71:366 (1983); Dome, A.-J. et al., FEBS Lett. 145:30 (1982). 

As referred to herein, the term "cDNA" is meant a nucleic acid, either 

10 naturally occurring or synthetic, which encodes a protein product. The term 
"nucleic acid" is intended to mean natural and/or synthetic linear, circular and 
sequential arrays of nucleotides and nucleosides, e.g., cDNA, genomic DNA 
(gDNA), mRNA, and RNA, oligonucleotides, oligonucleosides, and derivatives 
thereof. The term "encoding" is intended to mean that the subject nucleic acid 

1 5 may be transcribed and translated into either the desired polypeptide or the subject 
protein in an appropriate expression system, e.g., when the subject nucleic acid is 
linked to appropriate control sequences such as promoter and enhancer elements in 
a suitable vector (e.g., an expression vector) and when the vector is introduced 
into an appropriate system or cell. As used herein, "polypeptide" refers to an 

20 amino acid sequence which comprises both full-length protein and fragments 
thereof. 

As referred to herein, the term "capable of hybridizing under high stringency 
conditions" means annealing a strand of DNA complementary to the DNA of 
interest under highly stringent conditions. Likewise, "capable of hybridizing under 

25 low stringency conditions" refers to annealing a strand of DNA complementary to 
the DNA of interest under low stringency conditions. In the present invention, 
hybridizing under either high or low stringency conditions would involve hybridizing 
a nucleic acid sequence [e.g., the complementary sequence to SEQ ID No. 1 or 
portion thereof), with a second target nucleic acid sequence, "High stringency 

30 conditions" for the annealing process may involve, for example, high temperature 
and/or lower salt content, which disfavor hydrogen bonding contacts among 
mismatched base pairs. "Low stringency conditions" would involve lower 
temperature, and/or higher salt concentration than that of high stringency 
conditions. Such conditions allow for two DNA strands to anneal if substantial, as 

35 is the case among DNA strands that code for the same protein but differ in 
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sequence due to the degeneracy of the genetic code. Appropriate stringency 
conditions which promote DNA hybridization, for example, 6X SSC at about 45° C, 
followed by a wash of 2X SSC at 50° C, are known to those skilled in the art or 
can be found in Current Protocols in Molecular Biology, John Wiley & Sons, NY 
5 (1989), 6.31-6.3.6. For example, the salt concentration in the wash step can be 
selected from a low stringency of about 2X SSC at 50° C, to a high stringency of 
about 0.2X SSC at 50° C. In addition, the temperature in the wash step can be 
increased from low stringency at room temperature, about 22" C, to high 
stringency conditions, at about 65° C. Other stringency parameters are described 

10 in Maniatis, T., et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Cold Spring NY, (1982), at pp. 387-389; see also Sambrook J. 
et al., Molecular Cloning: A Laboratory Manual, Second Edition, Volume 2, Cold 
Spring Harbor Laboratory Press, Cold Spring, NY at pp. 8.46-8.47 (1989). 

The foregoing and other aspects of the invention may be better understood 

1 5 in connection with the following examples, which are presented for purposes of 
illustration and not by way of limitation.* 

SPECIFIC EXAMPLE 1 

CONSTRUCTION OF act 1, dgdl DOUBLE MUTANT 

20 An act 1,dgd1 double mutant (Figure 1) was constructed. An F2 population 

of plants derived from the cross of act1 (Arabidopsis Biological Resource Center, 
Columbus, Ohio) and dgdl was screened by thin-layer and gas chromatography 
for the characteristic lipid and fatty acid phenotype anticipated for the double 
mutant. About 1/16 of the F2 plants contained strongly reduced amounts of 

25 7,10,1 3-hexadecatrienoic acid as found in the act1 mutant and reduced amounts 
of the galactolipid DGD, indicative for the dgdl mutation. This was the expected 
result for the segregation of two recessive unlinked mutations. Four-week-old 
representative plants raised on soil of the wild type (ecotype Col-2}, the single 
homozygous mutants actl and dgdl t and the act1,dgd1 double homozygous 

30 mutant are shown in Figure 2. The double mutant was severely stunted (Figure 
2), and showed a more extreme growth phenotype than each of the mutant 
parents and any other known lipid mutant of Arabidopsis. The fatty acid 
composition of the galactolipids of the mutants as well as their fraction of total 
polar lipids were measured in leaves obtained from tissue-culture grown plants 

35 by thin-layer chromatography and subsequent gas chromatography of fatty acid 
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methyl esters (Table 1). 



Monogalactosyldiacylglycerol 


Wild Type" 


dgd1 a 


acf1 a 


acthdgdf 


49.8 


53.5 


53.7 


44.2 


7,10,13-hexadecatrienoic acid 


28.9 


14.1 


0.8 


0.7 


a-linolenic acid 


62.5 


78.5 


88.5 


86.4 


Digalactosyldiacylglycerol 


12 


1.7 


15.9 


2.1 


7,10,13-hexadecatrienoic acid 


2.6 


2.8 


n.d. 


n.d. 


a-linolenic acid 


71.1 


41.2 


83.0 


44.6 



'The values are given in mol % of total polar leaf lipids for the two galactolipids 



and mol % of fatty acids attached to each of the two galactolipids for 7,10,13- 
5 hexadecatrienoic acid (0//-c/s-16:3A 7 ' 10 * 13 ) and a-linolenic acid (a//-c/s-1 8:3A 9,12,15 ). The 
values represent the means of three measurements each. The standard deviation was 
below 2.5% (galactolipids) and 1.0% (fatty acids). n.d., not detectable. 

Because the lipid and fatty acid composition of the double mutant was not more 

10 severe than that of either of the parents, it is unlikely that the extreme growth 
phenotype of the double mutant is due to specific lipid or fatty acid effects. 
Thus, it seems plausible to conclude that the double mutant cannot produce 
sufficient amounts of thylakoid membranes, because the plastid pathway is 
affected by the act1 mutation and the ER pathway by the dgdl mutation. A 

15 pathway model consistent with the available biochemical data and the single and 
double mutant phenotypes is shown in Figure 1 . For clarity, the model focuses 
on the galactolipids representing close to 80% of ail thylakoid lipids. At least two 
genes encoding putative monogalactosyl lipid synthases are present in 
Arabidopsis {Shimojima, M. et al. r PNAS (USA) 94:333 (1997); and it is 

20 proposed that these have different substrate specificities and different 
association with the inner or outer envelope in accordance with previous studies. 
Block, M.A. et al., J. Biol. Chem. 258:13281 (1983); Cline, K. et al., Plant 
Physiol. 71:366 (1983). This enzyme class is currently under investigation by 
others- Shimojima, M. et al., PNAS (USA) 94:333 (1997); Teucher, T. et al., 

25 Planta 184:319 (1991); Marechal, E. et al., Biol. Chem. 269:5788 (1994). 
Accordingly, a transient pool of monogalactosyl lipid is produced at the outer 
envelope from ER-derived diacylglycerol and immediately converted by DGD1. 
This process is accompanied by a transfer of lipid moieties from the outer to the 
inner envelope. In the absence of DGD1, monogalactosyl lipid cannot be 

30 efficiently synthesized via the ER-pathway but the plastid pathway can 
compensate for this deficiency. Only when both pathways are blocked as in the 
act1,dgd1 double mutant, the overall galactolipid biosynthesis is reduced to 
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levels insufficient to support growth. Apparently, the proposed initial 
biosynthesis of galactolipids at the outer envelope membrane cannot compensate 
for the biosynthesis via DGD1, but would explain the small amount of 
digalactosyl lipid and the altered molecular species composition of 
5 monogalactosyl lipid in the dgdl mutant. 

SPECIFIC EXAMPLE 2 
ISOLATION AND PURIFICATION OF THE GENE ENCODING DGD1 
While the analysis of the dgdl mutant and the act1,dgd1 double mutant 
in Specific Example 1 revealed the crucial role of DGD1 in galactolipid 

10 biosynthesis and subcellular lipid trafficking in higher plants, only the molecular 
and biochemical analysis of the dgdl gene product will yield a true understanding 
of the underlaying mechanism. The cloning of both the mutant dgdl and wild- 
type DGD1 genes represents the first step in this direction. Because no molecular 
information was available, the dgdl locus and the corresponding dgdl cDNA 

15 was isolated by a strategy based on the map position of dgdl. One of the 
difficulties encountered was the heterogeneous genetic background in the dgdl 
mutant with markers characteristic for ecotypes Col-2 or Ler found interspersed 
around the dgdJ locus. This problem was solved by integration of two different 
mapping populations derived from crosses of dgdl to Col-2 or Ler wild-type. 

20 From a total of 1 35 F 2 plants derived from the cross dgdl x Col-2, plants with 
cross-overs between the two PCR markers nga162 and nga172 were selected. 
Bell, C.J. et al., Genomics 19:137 (1994). In this F 2 population, the dgdl locus 
was mapped relative to the RFLP markers g4523, fad7, g4547, 5E-5 and 18A-1. 
Similarly, a total of 424 F 2 plants from the cross dgdl x Ler were screened for 

25 cross-overs between the PCR markers nga127 and ATHCHIB. This mapping 
population was used to score the RFLP markers g2488a and 31A-H. The RFLP 
markers were obtained from the Arabidopsis Biological Resource Center at 
Columbus, Ohio (g4523, fad7, g2488a, g4547} or from genomic fragments 
(31A-H, 5E-5, 18A-1) isolated from cosmid inserts in this study. Unambiguous 

30 scoring of the mutant phenotype had to be done by thin-layer chromatography of 
leaf lipid extracts, requiring several thousand samples to be processed during the 
fine mapping process. The map encompassing the DGD1 locus and the YAC, 
BAC, and cosmid contigs spanning the locus on chromosome three are shown in 
Figure 3. Figure 3 A shows the genetic map of the relevant part of Arabidopsis 

35 chromosome 3. Two YAC clones containing dgdl are shown. Numbers indicate 
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recombinations between a given marker and dgdl per number of chromosomes 
analyzed in the respective mapping population. Figure 3B shows the fine 
mapping between the markers fad7 and g2488a. The BAC (IGF clone#) and 
cosmid (C clone#) contigs are shown. Complementing clones are marked by ©, 

5 non-complementing by Figure 3C is a map of the cosmids C49B, C5A and C5E 
with H indicating Hinti\\\ restriction sites. Finally, Figure 3D shows the structure 
of the DGD1 gene and cDNA. The exons are shaded and numbered 1 to 7. The 
sequence predicted to be a chloroplast transit peptide {T) is indicated, as well as 
the part showing similarity to glycosyltransferases (GTF, cross hatched), the 

10 start and stop codon (ATG, TAG, respectively), and the C to T mutation 
observed in the dgdl mutant. Genomic DNA isolated from the markers fad7, 
g4547 and g2488a was used to isolate DNA fragments from different libraries 
(CIC Yeast Artificial Chromosome library (Camilleri, C. et aL, Plant J. 14:633 
(1998)); IGF Bacterial Artificial Chromosome library (Mozo, T. et al., Mol. Gen. 

15 Genet. 258:562 (1998)); Arabidopsis cosmid library (Meyer, K. et al., in Genome 
Mapping in Plants, A. H. Patterns, Ed. (Academic Press, New York, 1996), pp. 
137-154). Different cosmids harbouring inserts between T-DNA borders were 
tested for complementation. Because the dgdl mutant could not be transformed 
with these large genomic fragments, the clones were transferred into the wild 

20 type first and crossed the T-DNA into the dgdl mutant. Cosmid clones were 
transferred into Agrobacterium tumefaciens C58C1 (pGV2260) and used to 
transform Arabidopsis thaliana Col-2 wild type plants via vacuum infiltration 
(Bechtold, N. et al., Acad. Sci. Paris Life Sci. 316:1 194 (1993); Bent, A.F. et al., 
Science 265:1856 {1994)). Transformants were crossed with dgdl mutant 

25 plants and the segregation pattern in the F 2 generation was analyzed. 
Complementation was assumed when of 100 tested F 2 plants carrying the T- 
DNA all were phenotypically wild-type, whereas in non-complementing lines, a 
segregation of the wild-type versus mutant phenotype of 3 : 1 was expected. A 
minimum of 100 transgenic F 2 plants derived from each cross (1 to 3 

30 independent crosses per cosmid) were analyzed. To avoid the possibility that the 
construct was corrupted by chance in any particular plant, several independent 
transgenic lines were used. The analysis of three F 2 populations derived from 
crosses with independent lines containing cosmid C49B was consistent with 
genetic complementation by a gene encompassed by the insert. Several cosmids 

35 overlapping (C5A, C5E) or neighbouring C49B (C27A, C14B) did not complement 
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the mutation. Large portions of the cosmid C49B were sequenced (SEQ ID NO: 

5) . One putative gene was located in the centre of C49B, but was only partially 
contained by the cosmids C5A and C5E. Based on the complementation analysis 
for the cosmids C49B, CSA and C5E it was concluded that this gene represents 

5 the DGDJ locus. Therefore, the insert of C49B was used to screen a cDNA 
library. Uwer, U. et al. Plant Cell 10:1277 (1998). A 2683 bp long cDNA was 
identified and sequenced (Figures 3D and SEQ ID NO: 1). This cDNA appeared to 
be complete because it contained in-frame stop codons 5'-prime of a putative 
ATG start codon. The cDNA was inserted behind a CaMV 35S promoter and 

10 transferred directly into the mutant by Agrobacterium mediated in plants 
transformation. For direct complementation analysis, the DGD1 cDNA released 
from pBluescriptllSK( + ) with Smal, Xho\ was ligated into the Sma\, Sal\ sites of 
the binary vector pBINAR-Hyg (Becker, D., Nucl. Acids Res. 18:203 (1990)) in 
sense orientation behind the CaMV 35S promoter. This construct was directly 

1 5 transferred into the dgdl mutant via Agrobacterium by vacuum infiltration. Two 
transgenic plants were recovered which were phenotypically wild type with 
regard to habitus and lipid composition indicating complementation. To obtain 
corroborating evidence for complementation and to exclude the possibility of 
wild-type contamination, genetically homozygous dgdl plants were identified in 

20 each complementation experiment by DNA/DNA hybridization using the RFLP 
marker 5E-5 which scores identical in the Col-2 and Ler wild type background 
but different in the dgdl mutant. These plants were tested for lipids. With no 
exception, cosmid C49B and the DGDJ cDNA lead to wild-type lipid composition 
in all tested transgenic plants homozygous for dgdl. To obtain independent 

25 evidence for the identity of the DGD1 locus, the respective genomic DNA of the 
wild-type DGD1 and the mutant dgdl loci were sequenced (SEQ ID NOS: 5 and 

6) . Further comparison of the genomic and cDNA sequences revealed 7 exons 
and a transition of a CAA codon (glutamine) to TAA in axon 6 in the dgdl 
mutant gene leading to a premature stop codon. 

30 SPECIFIC EXAMPLE 3 

RECONSTITUTION OF THE PLANT GALACTO LIPID BIOSYNTHETIC 

PATHWAY IN £ COU 

The DGD1 cDNA is predicted to encode a 91.8 kDa protein with some 

sequence similarity in the C-terminal portion to bacterial and plant 
35 glycosyltransferases. To determine the biosynthetic activity of the wild-type 
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DGD1 gene product, the DGD I cDNA was expressed in £ coli along with the 
monogalactosyldiacylglycerol (MGD) synthase previously isolated from 
cucumber. Shimojima, M. et al., PNAS (USA) 94:333 (1997). A 459 bp Xhol, 
Pvull fragment including the expression cassette was isolated from pQE31 

5 (Qiagen Inc.) and ligated into the Sail, EcoRV sites of pACYC184 (Chang, A.C.Y. 
et al., J. Bacterid. 134:1141 (1978)) giving rise to the plasmid pACYC-31. The 
open reading frame of the DGD1 cDNA was amplified by PCR using the primers 
5'-GCGGATCCGGTAAAGGAAACTCTAATT-3' (Ben239; SEQ ID NO: 7) and 5'- 
TTCTGCAGTCTACCAGCCGAAGATTGG-3' (Ben241; SEQ ID NO: 8), thereby 

10 introducing a BamHI site at the 5' and a Pstl site at the 3' terminus. This cDNA 
fragment was ligated into the corresponding restriction sites of pACYC-31. The 
resulting vector, pACYC-31/239, was transferred into XL1-Blue cells carrying the 
expression vector pGEX-3X with the cucumber MGD synthase cDNA (Shimojima 
et al. 1998). The cells were grown and protein expression induced with IPTG. 

1 5 The QIA expressionist: A handbook for high-level expression and purification of 
6x His-tagged proteins. Qiagen, Inc., Valencia, CA (1997). The polar lipids were 
extracted from the £ coli cells with 1 volume 1 M KCI, 0.2 M H 3 P0 4 and 2 
volumes methanol/chloroform (1:1, v/v). Polar lipids were then separated by 
thin-layer chromatography on ammonium sulfate (0.15 M) impragnated Baker 

20 Si250PA silica plates developed in acetone/toluene/water (90:30:8, v/v/v). 
Digalactosyldiacylglycerol was visualized by staining with cc-naphthol. In addition 
to MGD a new glycolipid was observed (Lane 2, Figure 4) that co-migrates with 
an authentic digalactosyldiacylglycerol (DGD) standard (Lane 3, Figure 4). In 
contrast, when the MGD synthase gene but not the dgdl cDNA was present, no 

25 DGD was observed (Lane 1, Figure 4). The plant galactolipid biosynthetic 
pathway was therefore, reconstituted in £ . coli. Furthermore, this result 
demonstrates that the dgdl gene indeed encodes a DGD galactosyltransferase. 

A gene essential for the biosynthesis of the thylakoid lipid 
digalactosyldiacylglycerol was isolated from Arabidopsis by map-based cloning. 

30 The act1,dgd1 double mutant analysis strongly suggests that DGD1 also plays a 
critical role in lipid trafficking of ER-derived thylakoid lipids in higher plants. The 
availability of the wild-type DGD 7 gene, the similar DGD2 gene, as well as genes 
encoding monogalactosyl lipid synthases of Arabidopsis will permit the rigorous 
testing of the current hypothesis for galactolipid biosynthesis and subcellular lipid 

35 trafficking described herein. 
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The foregoing discussion discloses and describes merely exemplary 
embodiments of the present invention. One skilled in the art will readily recognize 
from such discussion, and from the accompanying drawings, that various changes r 
modifications and variations can be made therein without departing from the spirit 
5 and scope of the invention. 

Patent and literature references cited herein are incorporated by reference 
as if fully set forth. 



-1- 

SEQUENCE LISTING 

<110> Benning, Christoph 
Doermann, Peter 

<120> DNA Encoding For Plant Digalactosydiacylglycerol 
Galactosyltransferase And Methods of Use 

<130> 6550-000042 

<140> 
<141> 

<160> 8 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2649 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 1 
gaattcggca 


cgaggcagca 


ctctcacgaa 


atcgtcgtga 


cgaaacgata aaccctaagt 


60 


catggtaaag 


gaaactctaa 


ttcctccgtc 


atctacgtca 


atgacgaccg gaacatcttc 


120 


ttcttcgtct 


ctttcaatga 


cgttatcctc 


aacaaacgcg 


ttatcgtttt tgtcgaaagg 


180 


atggagagag 


gtatgggatt 


cagcagatgc 


ggatttgcag 


ctgatgcgag acagagctaa 


240 


ctctgttaag 


aatctagcat 


caacgttcga 


tagagagatc 


gagaatttcc tcaataactc 


300 


ggcgaggtct 


gcgtttcccg 


ttggttcacc 


atcggcgtcg 


tctttctcaa atgaaattgg 


360 


tatcatgaag 


aagcttcagc 


cgaagatttc 


ggagtttcgt 


agggtttatt cggcgccgga 


420 


gattagtcgc 


aaggttatgg 


agagatgggg 


acctgcgaga 


gcgaagcttg gaatggatct 


480 


atcggcgatt 


aagaaggcga 


ttgtgtctga 


gatggaattg 


gatgagcgtc agggagtttt 


540 


ggagatgagt 


agattgagga 


gacggcgtaa 


tagtgatagg 


gttaggttta cggagttttt 


600 


cgcggaggct 


gagagagatg 


gagaagctta 


tttcggtgat 


tgggaaccga ttaggtcttt 


660 


gaagagtaga 


tttaaagagt 


ttgagaaacg 


aagctcgtta 


gaaatattga gtggattcaa 


720 


gaacagtgaa 


tttgttgaga 


agctcaaaac 


cagctttaaa 


tcaatttaca aagaaactga 


780 


tgaggctaag 


gatgtccctc 


cgttggatgt 


acctgaactg 


ttggcatgtt tggttagaca 


840 


atctgaacct 


tttcttgatc 


agattggtgt 


tagaaaggat 


acatgtgacc gaatagtaga 


900 


aagcctttgc 


aaatgcaaga 


gccaacaact 


ttggcgtctg 


ccatctgcac aagcatccga 


960 


tttaattgaa 


aatgataacc 


atggagttga 


tttggatatg 


aggatagcca gtgttcttca 


1020 


aagcacagga 


caccattatg 


atggtgggtt 


ttggactgat 


tttgtgaagc ctgagacacc 


1080 


ggaaaacaaa 


aggcatgtgg 


caattgttac 


aacagctagt 


cttccttgga tgaccggaac 


1140 


agctgtaaat 


ccgctattca 


gagcggcgta 


tttggcaaaa 


gctgcaaaac agagtgttac 


1200 


tctcgtggtt 


ccttggctct 


gcgaatctga 


tcaagaacta 


gtgtatccaa acaatctcac 


1260 


cttcagctca 


cctgaagaac 


aagagagtta 


tatacgtaaa 


tggttggagg aaaggattgg 


1320 


tttcaaggct 


gattttaaaa 


tctcctttta 


cccaggaaag 


ttttcaaaag aaaggcgcag 


1380 
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catatttcct 


gctggtgaca 


cttctcaatt 


tatatcgtca 


aaagatgccg 


acattgctat 


1440 


acttgaagaa 


cctgaacatc 


tcaactggta 


ttatcacggc 


aagcgttgga 


ctgataaatc 


1500 


caaccatgtt 


gttggaattg 


tccacacaaa 


ctacttagag 


tacatcaaga 


gggagaagaa 


1560 


tggagctctt 


caagcatttt 


ttgtgaacca 


tgtaaacaat 


tgggtcacac 


gagcgtattg 


1620 


tgacaaggtt 


cttcgcctct 


ctgcggcaac 


acaagattta 


ccaaagtctg 


ttgtatgcaa 


1680 


tgtccatggt 


gtcaatccca 


agttccttat 


gattggggag 


aaaattgctg 


aagagagatc 


1740 


ccgtggtgaa 


caagctttct 


caaaaggtgc 


atacttctta 


ggaaaaatgg 


tgtgggctaa 


1800 


aggatacaga 


gaactaatag 


atctgatggc 


taaacacaaa 


agcgaacttg 


ggagcttcaa 


1860 


tctagatgta 


tatgggaacg 


gtgaagatgc 


agtcgaggtc 


caacgtgcag 


caaagaaaca 


1920 


tgacttgaat 


ctcaatttcc 


tcaaaggaag 


ggaccacgct 


gacgatgctc 


ttcacaagta 


1980 


caaagtgttc 


ataaacccca 


gcatcagcga 


tgttctatgc 


acagcaaccg 


cagaagcact 


2040 


agccatgggg 


aagtttgtgg 


tgtgtgcaga 


tcacccttca 


aacgaattct 


ttagatcatt 


2100 


cccgaactgc 


ttaacttaca 


aaacatccga 


agactttgtg 


tccaaagtgc 


aagaagcaat 


2160 


gacgaaagag 


ccactacctc 


tcactcctga 


acaaatgtac 


aatctctctt 


gggaagcagc 


2220 


aacacagagg 


ttcatggagt 


attcagatct 


cgataagatc 


ttaaacaatg 


gagagggagg 


2280 


aaggaagatg 


cgaaaatcaa 


gatcggttcc 


gagctttaac 


gaggtggtcg 


atggaggatt 


2340 


ggcattctca 


cactatgttc 


taacagggaa 


cgatttcttg 


agactatgca 


ctggagcaac 


2400 


accaagaaca 


aaagactatg 


ataatcaaca 


ttgcaaggat 


ctgaatctcg 


taccacctca 


2460 


cgttcacaag 


ccaatcttcg 


gctggtagat 


atttccccat 


aggccaccca 


gttattgctt 


2520 


gtgacttatt 


aaaccactac 


gttattgtta 


tcctttttta 


cttttacagt 


tgttgtaggt 


2580 


cgtttgtttg 


ttaatagaaa 


gggtagatta 


ttattaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


2640 


aaactcgag 
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<210> 2 
<211> 808 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Val Lys Glu Thr Leu He Pro Pro Ser Ser Thr Ser Met Thr Thr 
15 10 15 

Gly Thr Ser Ser Ser Ser Ser Leu Ser Met Thr Leu Ser Ser Thr Asn 
20 25 30 

Ala Leu Ser Phe Leu Ser Lys Gly Trp Arg Glu Val Trp Asp Ser Ala 
35 40 45 

Asp Ala Asp Leu Gin Leu Met Arg Asp Arg Ala Asn Ser Val Lys Asn 
50 55 60 

Leu Ala Ser Thr Phe Asp Arg Glu He Glu Asn Phe Leu Asn Asn Ser 
65 70 75 80 

Ala Arg Ser Ala Phe Pro Val Gly Ser Pro Ser Ala Ser Ser Phe Ser 
65 90 95 
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Asn Glu He Gly He Met Lys Lys Leu Gin Pro Lys He Ser Glu Phe 
100 105 HO 

Arg Arg Val Tyr Ser Ala Pro Glu He Ser Arg Lys Val Met Glu Arg 
115 120 125 

Trp Gly Pro Ala Arg Ala Lys Leu Gly Met Asp Leu Ser Ala He Lys 
130 " 135 140 

Lys Ala He Val Ser Glu Met Glu Leu Asp Glu Arg Gin Gly Val Leu 
145 150 155 160 

Glu Met Ser Arg Leu Arg Arg Arg Arg Asn Ser Asp Arg Val Arg Phe 
165 ' 170 175 

Thr Glu Phe Phe Ala Glu Ala Glu Arg Asp Gly Glu Ala Tyr Phe Gly 
180 185 190 

Asp Trp Glu Pro He Arg Ser Leu Lys Ser Arg Phe Lys Glu Phe Glu 
195 ~ 200 205 

Lys Arg Ser Ser Leu Glu He Leu Ser Gly Phe Lys Asn Ser Glu Phe 
210 215 220 

Val Glu Lys Leu Lys Thr Ser Phe Lys Ser He Tyr Lys Glu Thr Asp 
225 * 230 235 240 

Glu Ala Lys Asp Val Pro Pro Leu Asp Val Pro Glu Leu Leu Ala Cys 
245 250 255 

Leu Val Arg Gin Ser Glu Pro Phe Leu Asp Gin He Gly Val Arg Lys 
260 265 270 

Asp Thr Cys Asp Arg He Val Glu Ser Leu Cys Lys Cys Lys Ser Gin 
275 280 285 

Gin Leu Trp Arg Leu Pro Ser Ala Gin Ala Ser Asp Leu He Glu Asn 
290 ~ 295 300 

Asp Asn His Gly Val Asp Leu Asp Met Arg He Ala Ser Val Leu Gin 
305 310 315 320 

Ser Thr Gly His His Tyr Asp Gly Gly Phe Trp Thr Asp Phe Val Lys 
325 330 335 

Pro Glu Thr Pro Glu Asn Lys Arg His Val Ala He Val Thr Thr Ala 
340 345 350 

Ser Leu Pro Trp Met Thr Gly Thr Ala Val Asn Pro Leu Phe Arg Ala 
355 360 365 

Ala Tyr Leu Ala Lys Ala Ala Lys Gin Ser Val Thr Leu Val Val Pro 
370 375 380 

Trp Leu Cys Glu Ser Asp Gin Glu Leu Val Tyr Pro Asn Asn Leu Thr 
385 * 390 395 400 

Phe Ser Ser Pro Glu Glu Gin Glu Ser Tyr He Arg Lys Trp Leu Glu 
405 410 415 

Glu Arg He Gly Phe Lys Ala Asp Phe Lys He Ser Phe Tyr Pro Gly 
420 * 425 430 

Lys Phe Ser Lys Glu Arg Arg Ser He Phe Pro Ala Gly Asp Thr Ser 
435 440 445 

Gin Phe He Ser Ser Lys Asp Ala Asp He Ala He Leu Glu Glu Pro 
450 455 460 
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Glu His Leu Asn Trp Tyr Tyr His Gly Lys Arg Trp Thr Asp Lys Phe 
465 470 475 480 

Asn His Val Val Gly lie Val His Thr Asn Tyr Leu Glu Tyr He Lys 
485 490 495 

Arg Glu Lys Asn Gly Ala Leu Ql-n Ala Phe Phe Val Asn His Val Asn 
500 505 510 

Asn Trp Val Thr Arg Ala Tyr Cys Asp Lys Val Leu Arg Leu Ser Ala 
515 520 525 

Ala Thr Gin Asp Leu Pro Lys Ser Val Val Cys Asn Val His Gly Val 
530 ~ 535 540 

Asn Pro Lys Phe Leu Met He Gly Glu Lys He Ala Glu Glu Arg Ser 
545 550 555 560 

Arg Gly Glu Gin Ala Phe Ser Lys Gly Ala Tyr Phe Leu Gly Lys Met 
565 570 575 

Val Trp Ala Lys Gly Tyr Arg Glu Leu He Asp Leu Met Ala Lys His 
580 585 590 

Lys Ser Glu Leu Gly Ser Phe Asn Leu Asp Val Tyr Gly Asn Gly Glu 
595 600 605 

Asp Ala Val Glu Val Gin Arg Ala Ala Lys Lys His Asp Leu Asn Leu 
610 615 620 

Asn Phe Leu Lys Gly Arg Asp His Ala Asp Asp Ala Leu His Lys Tyr 
625 630 635 640 

Lys Val Phe He Asn Pro Ser He Ser Asp Val Leu Cys Thr Ala Thr 
645 650 655 

Ala Glu Ala Leu Ala Met Gly Lys Phe Val Val Cys Ala Asp His Pro 
660 665 670 

Ser Asn Glu Phe Phe Arg Ser Phe Pro Asn Cys Leu Thr Tyr Lys Thr 
675 680 685 

Ser Glu Asp Phe Val Ser Lys Val Gin Glu Ala Met Thr Lys Glu Pro 
690 695 700 

Leu Pro Leu Thr Pro Glu Gin Met Tyr Asn Leu Ser Trp Glu Ala Ala 
705 710 715 720 

Thr Gin Arg Phe Met Glu Tyr Ser Asp Leu Asp Lys He Leu Asn Asn 
725 730 735 

Gly Glu Gly Gly Arg Lys Met Arg Lys Ser Arg Ser Val Pro Ser Phe 
740 ~ 745 750 

Asn Glu Val Val Asp Gly Gly Leu Ala Phe Ser His Tyr Val Leu Thr 
755 760 765 

Gly Asn Asp Phe Leu Arg Leu Cys Thr Gly Ala Thr Pro Arg Thr Lys 
770 " 775 780 

Asp Tyr Asp Asn Gin His Cys Lys Asp Leu Asn Leu Val Pro Pro His 
785 " 790 795 800 

Val His Lys Pro He Phe Gly Trp 
805 



<210> 3 
<211> 2500 
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<212> DNA 

<213> Arabidopsis thaliana 
<220> ^ 

<221> misc_feature 
<222> (467) . . (983) 
<223> exon 1 

<220> 

<221> misc feature 
<222> (1201) . . (1427) 
<223> exon 2 

<220> 

<221> miscjfeature 
<222> (1608) . . (1874) 
<223> exon 3 

<220> 

<221> misc_f eature 
<222> (1987) . . (2073) 
<223> exon 4 

<220> 

<221> misc_feature 
<222> (2261) . . (2332) 
<223> exon 5 

<400> 3 

tgtgggatta agattctata ttatcttttt 
gaaaagattc acaagatctt tattagaaga 
tgaaaagtaa accattaatg cacaggacac 
tttcttatta gcaattagat atggctctaa 
tgcgaattcg aaatgatatg tacagcttct 
ttataattca gactcgtgtc atcatacatg 
atcaatatta caccaaagct gctatagaat 
aaaaatatcc caatgatatg cagacaaaga 
gagtatttgg ggaaggagtg ttgagagaca 
gctgcaagct tccaggtata gcaccaaaag 
agaaatgaat atacgctgac atatcctcca 
caaacacact tctctttgac aagttcgaat 
cttttataaa ccgttgtgta gcagcttccc 
gttgcgatgg ttgttcccca agagccttga 
cataggttct gcagttggga aactgtttga 
ctactatttt tcccatcgcc aaggcttctg 
tggggttgag aaacacttta tagctgaaaa 
ccgatagact gcaaggaaat caatcaatct 
tgaaatagcg agtcagcgtg atcacgtcct 
cgggctgctt ctttgatctc ttcagagtcc 



gataataaaa 


acaaaaatat 


tagcaaaata 


60 


aaagtgaatg 


gtatgcataa 


ataaaatcca 


120 


gagttactca 


tagttcatat 




180 


tatgagcaat 


tggccacata 


agtggacaaa 


240 


gattctttta 


atggaatcat 


atactaaatt 


300 


ttccacaaga 


tgaaaaatct 


aactttcatc 


360 


gattcagata gtatacattg 


atttatacac 


420 


taaattaaaa 


aacaggtcaa 


tcttgcttgc 


480 


aaccaagatc 


tctacataac 


tcttcatcgg 


540 


ctgttctgga 


agcttcaaat 


ccagacgcca 


600 


agtttttccc 


cacactaatc 


gaagatgacg 


660 


cagctcttga 


taaccggttc 


agatcagaga 


720 


atgatagttc 


atgcctttgt 


tgctctgtaa 


780 


gcgtggctct 


tacaaaacct 


tgtccatcgt 


840 


agaacttgtt 


cgatatgtga 


ttcgcgcata 


900 


cagttgttgt 


acacacaacg 


tctgtcgtgc 


960 


tagaagatgg 


aaaagtttaa 


ggtatagaat 


1020 


tgaggctgtt 


aagaactaga 


gacttacttg 


1080 


gggtaaacat 


taaccgtcaa 


gtcgagtttt 


1140 


tctccatcac 


cgtataaatc 


aacctctaac 


1200 



tcggcaagtt ccttttggtg tttctcaagt agtttaagaa gctccttgta ccctttgctc 1260 
cagaccatct taccaatgta gtatgcgcct ttagtgaagg gctgctcctg gagcttctgc 1320 
tgttctagtt ttctcaaccc aatttcgaga aatttagggt tgacgccatg aacattgcag 1380 
actatagatt tagggtattc ttgagtcgca gcagataacc tgattacctg caggagtaac 1440 
atcattggta caagtcctga ttcacatcaa aaggaccggc cataattgtg gtcacattcc 1500 
aacaaactgt cttgattctc aaattaaacc cagccattgg atatattcac ataattgagc 1560 
aatcactcta tcatgtccta gtggctgtaa agaagaaaga agtttacctt gtggcagtaa 1620 
atgccaacaa cccaactatt taagtatttg aggaaaaatg ctttgacacg gccttgtttc 1680 
tctcttttaa cgtattccaa gtagttagtg tgtacgattc ctatgacgta gttgaactta 1740 
gttttccatt tttggccatg atgaaaccat gtgagatgct caggctcctc gaggacagca 1800 
atgtctgcct cttcatcagg aatggcatca gatatatccc caacaggaag aatactcctt 1860 
ttgtcaatag caaactgcca taagataaat ttgaattgct aaggttggca caacacagtc 192 0 
acagagtact acgtaaaaga aatggaagaa ttcaactcta tatcacacgg aaatgaaaca I960 
atcgactcta tgaaaggaag cagtatgtgt ttggtaccca aaagaagatt cttgagagtc 204 0 
atacctttcc aggatagaaa cgtatctcaa aggctaaacg aaaagagact ctctcctcaa 2100 
gccactggcg gacataagct tcttgctctg acggggaact aaaagtgatg ctatttgggt 2160 
agacaagctt ttggtgcttc aaagtcagcc atggaatcac caacgtgacc cgtctttccc 2220 
catcatttgc aaggtaggca gcacggaaga gaggattaac agcagttccc gtaagccatg 2280 
gaatactagc tgttgtaaat atcgcaatgt gttgctcctg ctgattagtc atatcatctc 2340 
aagctaaacc tctacaaatt atcaaacaaa aagagtaaac caatagattc ttgtaattga 2400 
gtttgatctg attattgcat cttcccaata aggataacat tcgactacaa attcttaatt 24 60 
tttctgcaaa ttcaaacaat cttttcacac gattcaagcg 2500 

<210> 4 
<211> 389 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 4 

Met Thr Asn Gin Gin Glu Gin Hia lie Ala lie Phe Thr Thr Ala Ser 
15 10 15 

He Pro Trp Leu Thr Gly Thr Ala Pro Leu Arg Tyr Val Ser He Leu 
20 25 30 

Glu Arg Tyr Asp Ser Gin Glu Ser Ser Phe Gly Tyr Gin Thr His Thr 
35 40 45 

Ala Ser Phe His Arg Phe Ala He Asp Lys Arg Ser He Leu Pro Val 
50 55 60 

Gly Asp He Ser Asp Ala He Pro Asp Glu Glu Ala Asp He Ala Val 
65 70 75 BO 

Leu Glu Glu Pro Glu His Leu Thr Trp Phe His His Gly Gin Lys Trp 
85 90 95 
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Lys Thr Lys Phe Asn Tyr Val lie Gly He Val His Thr Asn Tyr Leu 
100 105 110 

Glu Tyr Val Lys Arg Glu Lys Gin Gly Arg Val Lys Ala Phe Phe Leu 
115 120 125 

Lys Tyr Leu Asn Ser Trp Val Val Gly He Tyr Cys His Lys Val He 
130 135 140 

Arg Leu Ser Ala Ala Thr Gin Glu Tyr Pro Lys Ser He Val Cys Asn 
145 150 155 160 

Val His Gly Val Asn Pro Lys Phe Leu Glu He Gly Leu Arg Lys Leu 
165 170 175 

Glu Gin Gin Lys Leu Gin Glu Gin Pro Phe Thr Lys Gly Ala Tyr Tyr 
180 185 190 

He Gly Lys Met Val Trp Ser Lys Gly Tyr Lys Glu Leu Leu Lys Leu 
195 200 205 

Leu Glu Lys His Gin Lys Glu Leu Ala Asp Tyr Lys Val Phe Leu Asn 
210 215 220 

Pro Ser Thr Thr Asp Val Val Cys Thr Thr Thr Ala Glu Ala Leu Ala 
225 230 235 240 

Met Gly Lys He Val Val Cys Ala Asn His He Ser Asn Lys Phe Phe 
245 250 255 

Lys Gin Phe Pro Asn Cys Arg Thr Tyr Asp Asp Gly Gin Gly Phe Val 
260 265 270 

Arg Ala Thr Leu Lys Ala Leu Gly Glu Gin Pro Ser Gin Leu Thr Glu 
275 280 285 

Gin Gin Arg His Glu Leu Ser Trp Glu Ala Ala Thr Gin Arg Phe He 
290 295 300 



Lys Val Ser Asp Leu Asn Arg Leu Ser Arg Ala Asp Ser Asn Leu Ser 
305 ' 310 315 320 

Lys Arg Ser Val Phe Ala Ser Ser Ser He Ser Val Gly Lys Asn Leu 
325 330 335 

Glu Asp Met Ser Ala Tyr He His Phe Leu Ala Ser Gly Phe Glu Ala 
340 345 350 

Ser Arg Thr Ala Phe Gly Ala He Pro Gly Ser Leu Gin Pro Asp Glu 
355 360 365 

Glu Leu Cys Arg Asp Leu Gly Leu Ser Leu Asn Thr Pro Ser Pro Asn 
370 375 380 

Thr Arg Lys Gin Asp 
385 



<210> 5 

<211> 5128 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> misc_feature 

<222> (627) . . (1369) 

<223> exon 1 



<220> 



<221> misc_f eature 
<222> (1565) . . (1597) 
<22 3> exon 2 

<220> 

<221> misc_feature 
<222> (1712) . . (1798) 
<223> exon 3 

<220> 

<221> misc_feature 
<222> (2186) . . (2668) 
<223> exon 4 

<220> 

<221> misc_feature 
<222> (2769) . . (3035) 
<223> exon 5 

<220> 

<221> misc feature 
<222> (3137) . . (3487) 
<223> exon 6 

<220> 

<221> misc_feature 
<222> (3570) . . (4207) 
<223> exon 7 

<220> 

<221> gene 

<222> (674) . . (4079) 

<223> coding region for DGD1 

<400> 5 

aagctttaac gagagcgatc aaaggcaaat 
cgaccgtgat attttgtatg atatatatat 
attgtataat ggtatcaaaa tttgctaaat 
gtccgttggt tgttttggtt aagtcggcct 
ggtctaatta cgtccactga tatcactttc 
aataccacgc cccgtaggac cagattcttc 
gacaatcgga taatctcgtc aaccactcac 
catgccacgt tggacacggt ctcacgggac 
tccgacttgt ttattttcca cgtgttaatc 
aataaaacaa tccaaattaa tctcagccgt 
ctcaacagat agagaaaacc ttattagcag 
taaaccctaa gtcatggtaa aggaaactct 
cggaacatct tcttcttcgt ctctttcaat 
tttgtcgaaa ggatggagag aggtatggga 
agacagagcc aactctgtta agaatctagc 
cctcaataac tcggcgaggt ctgcgtttcc 
aaatgaaatt ggtatcatga agaagcttca 
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cactagaaac 


tcttgagcaa gagttcatgt 


60 


ggtcaagatg 


gtgaagacaa aggttctact 


120 


ttctaacaat 


ttcacaaggt 


ctcgcgtggc 


180 


ctattttgat 


ccgttaatgg aactcgttac 


240 


caatgttttt 


ttttgttaaa 


tgtcattgtc 


300 


gattccagca 


tattcctaag 


ttaaatccga 


360 


aacacgccac 


gtaatcaatg 


atgtggaacc 


420 


aaggcttcag 


taaagagcgc 


ctatcgtccg 


480 


ttccagaata 


atcagaaaga 


gaattaaaaa 


540 


caatttctct 


tcttcttctt 


cccaaattct 


600 


cactctcacg 


aaatcgtcgt 


gacgaaacga 


660 


aattcctccg 


tcatctacgt 


caatgacgac 


720 


gacgttatcc 


tcaacaaacg 


cgttatcgtt 


780 


ttcagcagat 


gcggatttgc 


agctgatgcg 


640 


atcaacgttc 


gatagagaga 


tcgagaattt 


900 


cgttggttca 


ccatcggcgt 


cgtctttctc 


960 


gccgaagatt 


tcggagtttc 


gtagggttta 


1020 
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ttcggcgccg gagattagtc gcaaggttat 
tggaatggat ctatcggcga ttaagaaggc 
tcagggagtt ttggagatga gtagattgag 
tacggagttt ttcgcggagg ctgagagaga 
gattaggtct ttgaagagta gatttaaaga 
gagtggattc aagaacagtg aatttgttga 
caactttttg ggaacctatt tcaaaagttt 
ttcaatagcc acgacatttt cgtatgctga 
ttgagtttct tatggtgttt ggcaactttt 
gcagaaatca atttacaaag aaactgatga 
ctatcagttt ttcttgttac tttatatatt 
tttaacttgg cctcctttta ctttggtgca 
gttggcatgt ttggttagac aatctgaacc 
taagattgca tttttctctt catgatggtt 
gtttctctac cataggtggt ttttccgtca 
tgtctttgtt aagtgggttg cttgcttcac 
tgcattatca cactaatcct ggatatcaaa 
gatcaatcaa atgacatctg ccgtagacat 
ttaaacaccc tttcttgtac attatcttat 
ttctgttctg ctgaattttc tttcagatac 
atgcaagagc caacaacttt ggcgtctgcc 
tgataaccat ggagttgatt tggatatgag 
ccattatgat ggtgggtttt ggactgattt 
gcatgtggca attgttacaa cagctagtct 
gctattcaga gcggcgtatt tggcaaaagc 
ttggctctgc gaatctgatc aagaactagt 
tgaagaacaa gagagttata tacgtaaatg 
ttttaaaatc tccttttacc caggaaaggt 
tttctatggc tgccaatatg tttttcaatt 
tcttatagtt ttcaaaagaa aggcgcagca 
tatcgtcaaa agatgctgac attgctatac 
atcacggcaa gcgttggact gataaattca 
acttagagta catcaagagg gagaagaatg 
caaacaattg ggtcacacga gcgtattgtg 
agccttgttc tgttgcttga atcctcttta 



ggagagatgg ggacctgcga gagcgaagct 1080 
gattgtgtct gagatggaat tggatgagcg 114 0 
gagacggcgt aatagtgata gggttaggtt 1200 
tggagaagct tatttcggtg attgggaacc 1260 
gtttgagaaa cgaagctcgt tagaaatatt 1320 
gaagctcaaa accagctttg taagtttctc 13 80 
cttctatctt actgtagaag tggcttctct 1440 
cttgatagtt actgttcttg gaatgattga 1500 
ctaatggtgc ttttttcttt tgttttattt 1560 
ggctaaggtt ggtatttgga agttggagct 1620 
tttaggtcga gttcagtgtt ggattgggag 1680 
ggatgtccct ccgttggatg tacctgaact 1740 
ttttcttgat cagattggtg ttagaaaggg 1800 
aattattttg tctgttgtat gtattgagtt 1860 
aaagtttgaa tcttctctct gatattagag 1920 
cagaagttta gatggtgaga tttgatgttc 1980 
tatgtgtaag tctagattct gtatgagaca 2040 
aaaaatttct agatgtgtag gttattggtt 2100 
agtttcagtg tttacataca aagttcctga 2160 
atgtgaccga atagtagaaa gcctttgcaa 2220 
atctgcacaa gcatccgatt taattgaaaa 2280 
gatagccagt gttcttcaaa gcacaggaca 2340 
tgtgaagcct gagacaccgg aaaacaaaag 2400 
tccttggatg accggaacag ctgtaaatcc 2460 
tgcaaaacag agtgttactc tcgtggttcc 2520 
gtatccaaac aatctcacct tcagctcacc 2580 
gttggaggaa aggattggtt tcaaggctga 2640 
atgttgatca ttttggattc tattttttta 2700 
atttctatag agtaactgag ctttctggtt 2760 
tatttcctgc tggtgacact tctcaattta 2820 
ttgaagaacc tgaacatctc aactggtatt 2880 
accatgttgt tggaattgtc cacacaaact 2940 
gagctcttca agcatttttt gtgaaccatg 3000 
acaaggtgaa tcatctactc tatttcttca 3060 
ctaataaata gtacacgagc taatacatat 3120 



1 
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tttctactca 


tgaaaggttc 


ttcgcctctc 


tgcggcaaca 


caagatttac 


caaagtctgt 


3180 


tgtatgcaat 


gtccatggtg 


tcaatcccaa 


gttccttatg 


attggggaga 


aaattgctga 


3240 


agagagatcc 


cgtggtgaac 


aagctttctc 


aaaaggtgca 


tacttcttag 


gaaaaatggt 


3300 


gtgggctaaa 


ggatacagag 


aactaataga 


tctgatggct 


aaacacaaaa 


gcgaacttgg 


3360 


gagcttcaat 


ctagatgtat 


atgggaacgg 


tgaagatgca 


gtcgaggtcc 


aacgtgcagc 


3420 


aaagaaacat 


gacttgaatc 


tcaatttcct 


caaaggaagg 


gaccacgctg 


acgatgctct 


3480 


tcacaagtaa 


gttctgaaaa 


atgtgctttg 


cttttaaaaa 


cttgttaagg 


tttcgctctt 


3540 


tgattgtctt 


tcccacatct 


tgatgaaggt 


acaaagtgtt 


cataaacccc 


agcatcagcg 


3600 


atgttctatg 


cacagcaacc 


gcagaagcac 


tagccatggg 


gaagtttgtg 


gtgtgtgcag 


3660 


atcacccttc 


aaacgaattc 


tttagatcat 


tcccgaactg 


cttaacttac 


aaaacatccg 


3720 


aagactttgt 


gtccaaagtg 


caagaagcaa 


tgacgaaaga 


gccactacct 


ctcactcctg 


3780 


aacaaatgta 


caatctctct 


tgggaagcag 


caacacagag 


gttcatggag 


tattcagatc 


3840 


tcgataagat 


cttaaacaat 


ggagagggag 


gaaggaagat 


gcgaaaatca 


agatcggttc 


3900 


cgagctttaa 


cgaggtggtc 


gatggaggat 


tggcattctc 


acactatgtt 


ctaacaggga 


3960 


acgatttctt 


gagactatgc 


actggagcaa 


caccaagaac 


aaaagactat 


gataatcaac 


4020 


attgcaagga 


tctgaatctc 


gtaccacctc 


acgttcacaa 


gccaatcttc 


ggctggtaga 


4080 


tatttcccca 


taggccaccc 


agttattgct 


tgtgacttat 


taaaccacta 


cgttattgtt 


4140 


atcctttttt 


acttttacag 


ttgttgtagg 


tcgtttgttt 


gttaatagaa 


agggtagatt 


4200 


attattagat 


gtctttttgt 


aaaatatcaa 


tacgaagcgt 


atttgatgat 


atataaaata 


4260 


actatattgg 


caaaaatatg 


aactatgaag 


gccgttttcg 


tgattttgtt 


cttttgtttc 


4320 


acgaattcaa 


gctattcccc 


tttttttacg 


ccaaagatga 


aaagaaccct 


ccctattaat 


4380 


atcgctattg 


tctaaaattt 


cgaaaactac 


tttaatcacg 


actagaccaa 


atatatgtcg 


4440 


accgataccg 


atagagaaat 


tagtgccccg 


tctaatactt 


tctctccaaa 


attacagaat 


4500 


atttagagta 


gttaatcaac 


gtaacacgac 


aaggaaaatg 


atggaaaaag 


tggtggtttc 


4560 


tgctttggca 


actagtgtta 


ggtcacttac 


gtcctctttt 


tctgtattgg 


aaattacgtg 


4620 


gataaattga 


actttctttc 


aatctctatc 


aaattattaa 


tccacacatg 


tatacgcaat 


4680 


atatgatcat 


taaataaata 


aaaagttaga 


ttggtctata 


aattcgtatc 


acaaatggac 


4740 


taataatttg 


tagtgaaaac 


tcatttaccc 


atgtgacagc 


tccaaatttc 


tgaacttttt 


4800 


attttgaggg 


atggtacaaa 


tccgagttcc 


atgatcatgg 


aaaaatcaaa 


tttaacaaac 


4860 


acaaattact 


gtttgaaaca 


agcaagttac 


tatatatgta 


gtttgatttc 


acactagaga 


4920 


atctactgat 


aatgaatttt 


ttatatatcg 


tgaagctgaa 


agtgaaatta 


taactaacta 


4980 


gttgaattgc 


ttattatggt 


tggaggggag 


ccgatgaaaa 


attcttcgac 


cacacataaa 


5040 


gtcacctttc 


taaagaacac 


ttgcaccgac 


cacatcaatc 


acgcattcat 


aaattttcaa 


5100 


catttatata 


aatgtaggaa 


aaaaacaa 








5128 
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<210> 6 
<211> 5050 
<212> DNA 

<213> Arabidopais thaliana 
<220> 

<221> mutation 
<222> (334) 

<223> T to C exchange (5' non-transcribed region) 
<220> 

<22l> mutation 
<222> (3189) 

<223> C to T exchange (CAA to TAA r premature stop codon) 
<220> 

<221> mutation 
<222> (3434) 

<223> a insertion after 3434 (silent, in intron) 
<220> 

<221> mutation 
<222> (3649) 

<223> C to T exchange (conservative exchange, Ser codon) 
<220> 

<221> mutation 
<222> (4795) 

<223> A insertion after 4795 (3' non-transcribed 
region) 

<220> 

<221> mutation 
<222> (4868) 

<223> T to A exchange (3 1 non-transcribed region) 
<220> 

<221> mutation 
<222> (4874) 

<223> A to T exchange (3* non-transcribed region) 



<400> 6 

ttttgtatga tatatatatg gtcaagatgg 
gtatcaaaat ttgctaaatt tctaacaatt 
gttttggtta agtcggcctc tattttgatc 
gtccactgat atcactttcc aatgtttttt 
ccgtaggacc agattcttcg attccagcat 
aatctcgtca accactcaca acacgccacg 
ggacacggtc tcacgggaca aggcttcagt 
tattttccac gtgttaatct tccagaataa 
ccaaattaat ctcagccgtc aatttctctt 
gagaaaacct tattagcagc actctcacga 
tcatggtaaa ggaaactcta attcctccgt 
cttcttcgtc tctttcaatg acgttatcct 
gatggagaga ggtatgggat tcagcagatg 



tgaagacaaa 


ggttctacta 


ttgtataatg 


60 


tcacaaggtc 


tcgcgtggcg 


tccgttggtt 


120 


cgttaatgga 


actcgttacg 


gtctaattac 


180 


tttgttaaat 


gtcattgtca 


ataccatgcc 


240 


attcctaagt 


taaatccgac 


acaatcggat 


300 


taaccaatga 


tgtggaaccc 


atgccacgtt 


360 


aaagagcgcc 


tatcgtccgt 


ccgacttgtt 


420 


tcagaaagag 


aattaaaaaa 


ataaaacaat 


480 


cttcttcttc 


ccaaattctc 


tcaacagata 


540 


aatcgtcgtg 


acgaaacgat 


aaaccctaag 


600 


catctacgtc 


aatgacgacc 


ggaacatctt 


660 


caacaaacgc 


gttatcgttt 


ttgtcgaaag 


720 


cggatttgca 


gctgatgcga 


gacagagcta 


780 



-12- 



actctgttaa gaatctagca tcaacgttcg 
cggcgaggtc tgcgtttccc gttggttcac 
gtatcatgaa gaagcttcag ccgaagattt 
agattagtcg caaggttatg gagagatggg 
tatcggcgat taagaaggcg attgtgtctg 
tggagatgag tagattgagg agacggcgta 
tcgcggaggc tgagagagat ggagaagctt 
tgaagagtag atttaaagag tttgagaaac 
agaacagtga atttgttgag aagctcaaaa 
gaacctattt caaaagtttc ttctatctta 
cgacattttc gtatgctgac ttgatagtta 
atggtgtttg gcaacttttc taatggtgct 
tttacaaaga aactgatgag gctaaggttg 
tcttgttact ttatatattt ttaggtcgag 
ctccttttac tttggtgcag gatgtccctc 
tggttagaca atctgaacct tttcttgatc 
ttttctcttc atgatggtta attattttgt 
ataggtggtt tttccgtcaa aagtttgaat 
agtgggttgc ttgcttcacc agaagtttag 
actaatcctg gatatcaaat atgtgtaagt 
tgacatctgc cgtagacata aaaatttcta 
ttcttgtaca ttatcttata gtttcagtgt 
tgaattttct ttcagataca tgtgaccgaa 
aacaactttg gcgtctgcca tctgcacaag 
gagttgattt ggatatgagg atagccagtg 
gtgggttttg gactgatttt gtgaagcctg 
ttgttacaac agctagtctt ccttggatga 
cggcgtattt ggcaaaagct gcaaaacaga 
aatctgatca agaactagtg tatccaaaca 
agagttatat acgtaaatgg ttggaggaaa 
ccttttaccc aggaaaggta tgttgatcat 
gccaatatgt ttttcaatta tttctataga 
tcaaaagaaa ggcgcagcat atttcctgct 
gatgctgaca ttgctatact tgaagaacct 



atagagagat cgagaatttc ctcaataact 840 
catcggcgtc gtctttctca aatgaaattg 900 
cggagtttcg tagggtttat tcggcgccgg 960 
gacctgcgag agcgaagctt ggaatggatc 102 0 
agatggaatt ggatgagcgt cagggagttt 1080 
atagtgatag ggttaggttt acggagtttt 114 0 
atttcggtga ttgggaaccg attaggtctt 12 00 
gaagctcgtt agaaatattg agtggattca 1260 
ccagctttgt aagtttctcc aactttttgg 1320 
ctgtagaagt ggcttctctt tcaatagcca 1380 
ctgttcttgg aatgattgat tgagtttctt 1440 
tttttctttt gttttatttg cagaaatcaa 1500 
gtatttggaa gttggagctc tatcagtttt 1560 
ttcagtgttg gattgggagt ttaacttggc 1620 
cgttggatgt acctgaactg ttggcatgtt 1680 
agattggtgt tagaaagggt aagattgcat 1740 
ctgttgtatg tattgagttg tttctctacc 1800 
cttctctctg atattagagt gtctttgtta 1860 
atggtgagat ttgatgttct gcattatcac 1920 
ctagatcccg tatgagacag atcaatcaaa 1980 
gatgtgtagg ttattggttt taaacaccct 2040 
ttacatacaa agttcctgat tctgttctgc 2100 
tagtagaaag cctttgcaaa tgcaagagcc 2160 
catccgattt aattgaaaat gataaccatg 2220 
ttcttcaaag cacaggacac cattatgatg 22 80 
agacaccgga aaacaaaagg catgtggcaa 234 0 
ccggaacagc tgtaaatccg ctattcagag 2400 
gtgttactct cgtggttcct tggctctgcg 2460 
atctcacctt cagctcacct gaagaacaag 2520 
ggattggttt caaggctgat tttaaaatct 2580 
tttggattct atttttttat ttctatggct 2640 
gtaactgagc tttctggttt cttatagttt 2700 
ggtgacactt ctcaatttat atcgtcaaaa 2760 
gaacatctca actggtatta tcacggcaag 2820 
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cgttggactg 


ataaattcaa 


ccatgttgtt ggaattgtcc 


acacaaacta 


cttagagtac 


2880 


atcaagaggg 


agaagaatgg 


agctcttcaa gcattttttg 


tgaaccatgt 


aaacaattgg 


2940 


gtcacacgag 


cgtattgtga 


caaggtgaat catctactct 


atttcttcaa 


gccttgttct 


3000 


gttgcttgaa 


tcctctttac 


taataaatag tacacgagct 


aatacatatt 


ttctactcat 


3060 


gaaaggttct 


tcgcctctct 


gcggcaacac aagatttacc 


aaagtctgtt 


gtatgcaatg 


3120 


tccatggtgt 


caatcccaag 


ttccttatga ttggggagaa 


aattgctgaa 


gagagatccc 


3180 


gtggtgaata 


agctttctca 


aaaggtgcat acttcttagg 


aaaaatggtg 


tgggctaaag 


3240 


gatacagaga 


actaatagat 


ctgatggcta aacacaaaag 


cgaacttggg 


agcttcaatc 


3300 


tagatgtata 


tgggaacggt 


gaagatgcag tcgaggtcca 


acgtgcagca 


aagaaacatg 


3360 


acttgaatct 


caatttcctc 


aaaggaaggg accacgctga 


cgatgctctt 


cacaagtaag 


3420 


ttctgaaaaa 


tgtggctttg 


cttttaaaaa cttgttaagg 


tttcgctctt 


tgattgtctt 


3480 


tcccacatct 


tgatgaaggt 


acaaagtgtt cataaacccc 


agcatcagcg 


atgttctatg 


3540 


cacagcaacc 


gcagaagcac 


tagccatggg gaagtttgtg 


gtgtgtgcag 


atcacccttc 


3600 


aaacgaattc 


tttagatcat 


tcccgaactg cttaacttac 


aaaacatctg 


aagactttgt 


3660 


gtccaaagtg 


caagaagcaa 


tgacgaaaga gccactacct 


ctcactcctg 


aacaaatgta 


3720 


caatctctct 


tgggaagcag 


caacacagag gttcatggag 


tattcagatc 


tcgataagat 


37B0 


cttaaacaat 


ggagagggag 


gaaggaagat gcgaaaatca 


agatcggttc 


cgagctttaa 


3840 


cgaggtggtc 


gatggaggat 


tggcattctc acactatgtt 


ctaacaggga 


acgatttctt 


3900 


gagactatgc 


actggagcaa 


caccaagaac aaaagactat 


gataatcaac 


attgcaagga 


3960 


tctgaatctc 


gtaccacctc 


acgttcacaa gccaatcttc 


ggctggtaga 


tatttcccca 


4020 


taggccaccc 


agttattgct 


tgtgacttat taaaccacta 


cgttattgtt 


atcctttttt 


4080 


acttttacag 


ttgttgtagg 


tcgtttgttt gttaatagaa 


agggtagatt 


attattagat 


4140 


gtctttttgt 


aaaatatcaa 


tacgaagcgt atttgatgat 


atataaaata 


actatattgg 


4200 


caaaaatatg 


aactatgaag 


gccgttttcg tgattttgtt 


cttttgtttc 


acgaattcaa 


4260 


gctattcccc 


tttttttacg 


ccaaagatga aaagaaccct 


ccctattaat 


atcgctattg 


4320 


tctaaaattt 


cgaaaactac 


tttaatcacg actagaccaa 


atatatgtcg 


accgataccg 


4380 


atagagaaat 


tagtgccccg 


tctaatactt tctctccaaa 


attacagaat 


atttagagta 


4440 


gttaatcaac 


gcaacacgac 


aaggaaaatg atggaaaaag 


tggtggtttc 


tgctttggca 


4500 


actagtgtta 


ggtcacttac 


gtcctctttt tctgtattgg 


aaattacgtg 


gataaattga 


4560 


actttctttc 


aatctctatc 


aaattattaa tccacacatg 


tatacgcaat 


atatgatcat 


4620 


taaataaata 


aaaagttaga 


ttggtctata aattcgtatc 


acaaatggac 


taataatttg 


4680 


tagtgaaaac 


tcatttaccc 


atgtgacagc tccaaatttc 


tgaacttttt 


attttgaggg 


4740 


atggtacaaa 


tccgagttcc 


atgatcatgg aaaaatcaaa 


tttaacaaac 


acaaaattac 


4800 


tgtttgaaac 


aggcaagtta 


ctatatatgt agtttgattt 


cacactagag 


aatctactga 


4660 


taatgaaatt 


tttttatatc 


gtgaagctga aagtgaaatt 


ataactagct 


agttgaattg 


4920 



1 
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cttattatgg ttggagggga gccgatgaaa aattcttcga ccacacataa agtcaccttt 4980 
ctaaagaaca cttgcaccga ccacatcaat cacgcattca taaattttca acatttatat 5040 
aaatgtagga 5050 



<210> 7 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :01igonucleotide 
primer 

<400> 7 

gcggatccgg taaaggaaac tctaatt 27 

<210> 8 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
primer 



<400> 8 

ttctgcagtc taccagccga agattgg 



27 



15 



WE CLAIM: 

1 . An isolated nucleic acid molecule comprising a nucleic acid 
sequence encoding for a polypeptide having digalactosyldiacylglycerol 
galactosyltransferase activity, 

2. The nucleic acid molecule of Claim 1 wherein the nucleic acid 
molecule has a nucleotide sequence comprising SEQ ID NO: 1. 

3. The isolated nucleic acid molecule of Claim 1 wherein the 
nucleotide sequence encodes a polypeptide having an amino acid sequence 
comprising SEQ ID NO: 2. 

4. A vector comprising the nucleic acid of Claim 1 . 

5. A host cell comprising the vector of Claim 4. 

6. The host cell of Claim 5 wherein the host cell is a plant cell. 

7. An isolated nucleic acid molecule comprising a nucleotide 
sequence capable of hybridizing under high stringency conditions to SEQ ID NO: 
1 or the complement of SEQ ID NO: 1 . 

8. The nucleic acid molecule of Claim 7, wherein the nucleic acid 
molecule is capable of changing the lipid composition of a plant cell. 

9. A vector comprising the nucleic acid of Claim 7. 

10. A host cell comprising the vector of Claim 9. 

1 1 . The host cell of Claim 10 wherein the host cell is a plant cell. 
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12. A method for producing a polypeptide encoded by the nucleic acid 
molecule of Claim 1 comprising the steps of: 

a) inserting the nucleic acid molecule into a vector; 

b) transforming a host cell with the vector; and 

c) growing a culture of the host cell in a suitable culture medium. 

1 3. The method of Claim 1 2 wherein the host cell is £. coli. 

14. The method of Claim 12 wherein the host cell is yeast. 

15. A polypeptide comprising digalactosyldiacylglycerol 
galactosyltransf erase activity. 

16. The polypeptide of Claim 15 wherein the polypeptide has an amino 
acid sequence comprising SEQ ID NO: 2. 

17. The polypeptide of Claim 15 wherein the polypeptide is encoded 
for by a nucleic acid molecule having a nucleotide sequence comprising SEQ ID 
NO: 1. 

18. The polypeptide of Claim 15 wherein the polypeptide is encoded 
for by a nucleic acid molecule having a nucleotide sequence comprising SEQ ID 
NO: 3. 

19. A method for altering leaf lipid composition in a plant comprising 
introducing a nucleic acid molecule encoding for a polypeptide comprising 
digalactosyldiacylglycerol galactosyltransferase activity into a plant. 

20. The method of Claim 19 wherein the nucleic acid molecule 
comprises SEQ ID NO: 1. 

21. The method of Claim 19 wherein the nucleic acid molecule 
comprises SEQ ID NO: 3. 

22. A transgenic plant produced by the method of Claim 1 9. 
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23. The seeds of the plant of Claim 22. 

24. A transgenic plant produced by breeding the plant of Claim 22 
wherein the plant retains the trait of altered leaf lipid composition. 

25. A method for determining inhibitors of digalactosyldiacylglycerol 
galactosyltransferase comprising the steps of: 

a) contacting the galactosyltransferase with a compound; and 

b) determining any change in activity of the galactosyltransferase. 

26. The method of Claim 25 wherein the digalactosyldiacylglycerol 
galactosyltransferase has an amino acid sequence comprising SEQ ID NO: 2. 

27. The method of Claim 25 wherein the digalactosyldiacylglycerol 
galactosyltransferase has an amino acid sequence comprising SEQ ID NO: 4. 
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